Search CORE

24 research outputs found

Sustainable Software Ecosystems: Software Engineers, Domain Scientists, and Engineers Collaborating for Science

Author: Hanwell Marcus D.
O'Bara Bob
O'Leary Patrick
Publication venue
Publication date: 21/07/2014
Field of study

The development of scientific software is often a partnership between domain scientists and scientific software engineers. It is especially important to embrace these collaborations when developing advanced scientific software, where sustainability, reproducibility, and extensibility are important. In the ideal case, as discussed in this manuscript, this brings together teams composed of the world's foremost scientific experts in a given field with seasoned software developers experienced in forming highly collaborative teams working on software to further scientific research.Comment: 4 pages, submission for WSSSPE

arXiv.org e-Print Archive

FigShare

Sustainable Software Ecosystems for Open Science

Author: Hanwell Marcus D.
Hoffman Bill
O'Leary Patrick
Osterdahl Katie
Perera Amitha
Schroeder Will
Turner Wes
Publication venue
Publication date: 11/09/2013
Field of study

Sustainable software ecosystems are difficult to build, and require concerted effort, community norms and collaborations. In science it is especially important to establish communities in which faculty, staff, students and open-source professionals work together and treat software as a first-class product of scientific investigation-just as mathematics is treated in the physical sciences. Kitware has a rich history of establishing collaborative projects in the science, engineering and medical research fields, and continues to work on improving that model as new technologies and approaches become available. This approach closely follows and is enhanced by the movement towards practicing open, reproducible research in the sciences where data, source code, methodology and approach are all available so that complex experiments can be independently reproduced and verified.Comment: Workshop on Sustainable Software: Practices and Experiences, 4 pages, 3 figure

arXiv.org e-Print Archive

FigShare

Summary of the First Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE1)

Author: Allen Gabrielle D.
Berriman Bruce
Choi Sou-Cheng T.
Elster Anne C.
Hanwell Marcus D.
Hetherington James
Howison James
Katz Daniel S.
Lapp Hilmar
Löffler Frank
Maheshwari Ketan
Swenson Shel
Turk Matthew
Venters Colin
Wilkins-Diehr Nancy
Publication venue: 'Ubiquity Press, Ltd.'
Publication date: 12/06/2014
Field of study

Challenges related to development, deployment, and maintenance of reusable software for science are becoming a growing concern. Many scientists’ research increasingly depends on the quality and availability of software upon which their works are built. To highlight some of these issues and share experiences, the First Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE1) was held in November 2013 in conjunction with the SC13 Conference. The workshop featured keynote presentations and a large number (54) of solicited extended abstracts that were grouped into three themes and presented via panels. A set of collaborative notes of the presentations and discussion was taken during the workshop. Unique perspectives were captured about issues such as comprehensive documentation, development and deployment practices, software licenses and career paths for developers. Attribution systems that account for evidence of software contribution and impact were also discussed. These include mechanisms such as Digital Object Identifiers, publication of “software papers”, and the use of online systems, for example source code repositories like GitHub. This paper summarizes the issues and shared experiences that were discussed, including cross-cutting issues and use cases. It joins a nascent literature seeking to understand what drives software work in science, and how it is impacted by the reward systems of science. These incentives can determine the extent to which developers are motivated to build software for the long-term, for the use of others, and whether to work collaboratively or separately. It also explores community building, leadership, and dynamics in relation to successful scientific software

arXiv.org e-Print Archive

Directory of Open Access Journals

University of Huddersfield Repository

The Quixote project: Collaborative and Open Quantum Chemistry data management in the Internet age.

Author: Adams Sam
de Castro Pablo
Echenique Pablo
Estrada Jorge
Hanwell Marcus D
Murray-Rust Peter
Sherwood Paul
Thomas Jens
Townsend Joe
Publication venue: J Cheminform
Publication date: 01/01/2011
Field of study

Computational Quantum Chemistry has developed into a powerful, efficient, reliable and increasingly routine tool for exploring the structure and properties of small to medium sized molecules. Many thousands of calculations are performed every day, some offering results which approach experimental accuracy. However, in contrast to other disciplines, such as crystallography, or bioinformatics, where standard formats and well-known, unified databases exist, this QC data is generally destined to remain locally held in files which are not designed to be machine-readable. Only a very small subset of these results will become accessible to the wider community through publication.In this paper we describe how the Quixote Project is developing the infrastructure required to convert output from a number of different molecular quantum chemistry packages to a common semantically rich, machine-readable format and to build respositories of QC results. Such an infrastructure offers benefits at many levels. The standardised representation of the results will facilitate software interoperability, for example making it easier for analysis tools to take data from different QC packages, and will also help with archival and deposition of results. The repository infrastructure, which is lightweight and built using Open software components, can be implemented at individual researcher, project, organisation or community level, offering the exciting possibility that in future many of these QC results can be made publically available, to be searched and interpreted just as crystallography and bioinformatics results are today.Although we believe that quantum chemists will appreciate the contribution the Quixote infrastructure can make to the organisation and and exchange of their results, we anticipate that greater rewards will come from enabling their results to be consumed by a wider community. As the respositories grow they will become a valuable source of chemical data for use by other disciplines in both research and education.The Quixote project is unconventional in that the infrastructure is being implemented in advance of a full definition of the data model which will eventually underpin it. We believe that a working system which offers real value to researchers based on tools and shared, searchable repositories will encourage early participation from a broader community, including both producers and consumers of data. In the early stages, searching and indexing can be performed on the chemical subject of the calculations, and well defined calculation meta-data. The process of defining more specific quantum chemical definitions, adding them to dictionaries and extracting them consistently from the results of the various software packages can then proceed in an incremental manner, adding additional value at each stage.Not only will these results help to change the data management model in the field of Quantum Chemistry, but the methodology can be applied to other pressing problems related to data in computational and experimental science.RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Digital.CSIC

ePubs: the open archive for STFC research publications

Apollo (Cambridge)

Self-driving Multimodal Studies at User Facilities

Author: Allan Daniel B.
Campbell Stuart I.
Carbone Matthew R.
Caswell Thomas A.
DeCost Brian L.
Gavrilov Dmitri
Hanwell Marcus D.
Joress Howie
Lynch Joshua
Maffettone Phillip M.
Olds Daniel
Ravel Bruce
Wilkins Stuart B.
Wlodek Jakub
Publication venue
Publication date: 22/01/2023
Field of study

Multimodal characterization is commonly required for understanding materials. User facilities possess the infrastructure to perform these measurements, albeit in serial over days to months. In this paper, we describe a unified multimodal measurement of a single sample library at distant instruments, driven by a concert of distributed agents that use analysis from each modality to inform the direction of the other in real time. Powered by the Bluesky project at the National Synchrotron Light Source II, this experiment is a world's first for beamline science, and provides a blueprint for future approaches to multimodal and multifidelity experiments at user facilities.Comment: 36th Conference on Neural Information Processing Systems (NeurIPS 2022). AI4Mat Worksho

arXiv.org e-Print Archive

Open Data, Open Source and Open Standards in chemistry: The Blue Obelisk five years on

Author: Adams Samuel E
Alvarsson Jonathan
Bradley Jean-Claude
Filippov Igor V
Guha Rajarshi
Hanson Robert M
Hanwell Marcus D
Hutchison Geoffrey R
James Craig A
Jeliazkova Nina
Lang Andrew SID
Langner Karol M
Lonie David C
Lowe Daniel M
Murray-Rust Peter
O'Boyle Noel M
Pansanel Jerome
Pavlov Dmitry
Spjuth Ola
Steinbeck Christoph
Tenderholt Adam L
Theisen Kevin J
Willighagen Egon L
Publication venue
Publication date: 14/10/2011
Field of study

RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.Abstract Background The Blue Obelisk movement was established in 2005 as a response to the lack of Open Data, Open Standards and Open Source (ODOSOS) in chemistry. It aims to make it easier to carry out chemistry research by promoting interoperability between chemistry software, encouraging cooperation between Open Source developers, and developing community resources and Open Standards. Results This contribution looks back on the work carried out by the Blue Obelisk in the past 5 years and surveys progress and remaining challenges in the areas of Open Data, Open Standards, and Open Source in chemistry. Conclusions We show that the Blue Obelisk has been very successful in bringing together researchers and developers with common interests in ODOSOS, leading to development of many useful resources freely available to the chemistry community.Peer Reviewe

Maastricht University Research Portal

Springer - Publisher Connector

Irish Universities

PubMed Central

Cork Open Research Archive

Apollo (Cambridge)

The physical and structural properties of thiol encapsulated gold nanoparticle Langmuir-Schaeffer films

Author: Hanwell Marcus D
Publication venue
Publication date: 01/01/2007
Field of study

EThOS - Electronic Theses Online ServiceGBUnited Kingdo

OpenGrey Repository

Open Chemistry, JupyterLab, REST, and quantum chemistry

Author: Hanwell Marcus D,
Publication venue
Publication date: 10/11/2020
Field of study

Ezid

Open chemistry: RESTful web APIs, JSON, NWChem and the modern web application.

Author: de Jong Wibe A
Hanwell Marcus D
Harris Christopher J
Publication venue: eScholarship, University of California
Publication date: 13/07/2017
Field of study

An end-to-end platform for chemical science research has been developed that integrates data from computational and experimental approaches through a modern web-based interface. The platform offers an interactive visualization and analytics environment that functions well on mobile, laptop and desktop devices. It offers pragmatic solutions to ensure that large and complex data sets are more accessible. Existing desktop applications/frameworks were extended to integrate with high-performance computing resources, and offer command-line tools to automate interaction-connecting distributed teams to this software platform on their own terms. The platform was developed openly, and all source code hosted on the GitHub platform with automated deployment possible using Ansible coupled with standard Ubuntu-based machine images deployed to cloud machines. The platform is designed to enable teams to reap the benefits of the connected web-going beyond what conventional search and analytics platforms offer in this area. It also has the goal of offering federated instances, that can be customized to the sites/research performed. Data gets stored using JSON, extending upon previous approaches using XML, building structures that support computational chemistry calculations. These structures were developed to make it easy to process data across different languages, and send data to a JavaScript-based web client

arXiv.org e-Print Archive

Directory of Open Access Journals

eScholarship - University of California

The Visualization Toolkit (VTK): Rewriting the rendering code for modern graphics cards

Author: Aashish Chaudhary
Kenneth M. Martin
Lisa S. Avila
Marcus D. Hanwell
Publication venue: Elsevier
Publication date: 30/09/2015
Field of study

The Visualization Toolkit (VTK) is an open source, permissively licensed, cross-platform toolkit for scientific data processing, visualization, and data analysis. It is over two decades old, originally developed for a very different graphics card architecture. Modern graphics cards feature fully programmable, highly parallelized architectures with large core counts. VTK’s rendering code was rewritten to take advantage of modern graphics cards, maintaining most of the toolkit’s programming interfaces. This offers the opportunity to compare the performance of old and new rendering code on the same systems/cards. Significant improvements in rendering speeds and memory footprints mean that scientific data can be visualized in greater detail than ever before. The widespread use of VTK means that these improvements will reap significant benefits. Keywords: Visualization, Toolkit, Data analysis, Scientific dat

Elsevier - Publisher Connector

Directory of Open Access Journals